Variable time-scale modification of speech using transient information

نویسندگان

  • Sung-Joo Lee
  • Hee-Dong Kim
  • Hyung Soon Kim
چکیده

Conventional time-scale modification methods have the problem that as the modification rate gets higher the time-scale modified speech signal becomes less intelligible, because they ignore the effect of articulation rate on speech characteristics. In this paper, we propose a variable time-scale modification method based on the knowledge that the timing information of transient portions of a speech signal plays an important role in speech perception. After identifying transient and steady portions of a speech signal, the proposed method gets the target rate by modifying steady portions only. The result of subjective preference test indicates that the proposed method produces performance superior to that of the conventional SOLA method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computationally Effic Modification of Speech Us

Among the conventional time-scale modification methods [1][6], the synchronized overlap and add (SOLA) method [4] is used widely because of its good performance with relatively low computational complexity. But the SOLA method still requires much computation in evaluating the normalized crosscorrelation function for synchronization procedure [9]. In this paper, we employ 3 level center clipping...

متن کامل

Transient Natural Convection in an Enclosure with Variable Thermal Expansion Coefficient and Nanofluid Properties

Transient natural convection is numerically investigated in an enclosure using variable thermal conductivity, viscosity, and the thermal expansion coefficient of Al2O3-water nanofluid. The study has been conducted for a wide range of Rayleigh numbers (103≤ Ra ≤ 106), concentrations of nanoparticles (0% ≤ ϕ ≤ 7%), the enclosure aspect ratio (AR =1), and temperature differences between the cold a...

متن کامل

Practical high-quality speech and voice synthesis using fixed frame rate ABS/OLA sinusoidal modeling

This paper describes algorithms developed to apply the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal modeling system to real-time speech and singing voice synthesis. As originally proposed, the ABS/OLA system is limited to unidirectional timescaling, and relies on variable frame length to accomplish time-scale modification. For speech and voice synthesis applications, unidirectional ti...

متن کامل

A Speaking Rate Normalization Method Using Time-Scale Modification for Speech Recognition

In this paper, we propose a speaking rate normalization method by selecting a scaling factor of time-scale modification for speech recognition. It is shown from the speech recognition experiments that the proposed method reduces average word error rate compared to that without using any speaking rate normalization.

متن کامل

Effects of Pitch Contours Stylization and Time Scale Modification on Natural Speech Synthesis

This paper describes the method of generation of intonated speech for natural speech synthesis using prosody generation model. The effect of pitch modification through pitch contour stylization for parameter extraction and time scale modification for it’s implementation has been mentioned. An approach for close-copy syllabic stylization has been described. In the latter part, algorithm for impl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997